Towards Automatic Music Structural Analysis: Identifying Characteristic Within-Song Excerpts in Popular Music

نویسندگان

  • Bee Suan Ong
  • Xavier Serra
چکیده

Automatic audio content analysis is a general research area in which algorithms are developed to allow computer systems to understand the content of digital audio signals for further exploitations. The main focus therein is on the practical applications for audio files management, like automatic labeling, efficient browsing, or the retrieval of relevant files with little effort from a big database. Automatic music structural analysis is a specific subset of audio content analysis in which the domain of audio content is restricted to the semantically meaningful descriptions of audio in a musical context. The main task of automatic music structural analysis is to discover the structure of music by analyzing audio signals in order to facilitate a better handling of the current explosively expanding amounts of audio data available in digital collections. In this research work, we focus our investigation on two areas that are part of audio-based music structural analysis. First, we propose a unique framework and method for temporal audio segmentation at the semantic level. The system aims to detect the structural changes in music to provide a way to separate the different " sections " of a piece according to its structural titles (i.e. intro, verse, chorus, bridge, etc). We present a two-phase music segmentation system together with a combined set of low-level audio descriptors to be extracted form the music audio signals. Contrary to existing approaches, we consider the applicability of image processing methods in audio content analysis. A database of 54 audio files (The Beatles' song) is used for the evaluation of the proposed approach on a mainstream popular music collection. The experiment results demonstrate that our proposed algorithm has achieved 71% of 3 accuracy and 79% of reliability in a practical application for identifying structural boundaries in music audio signals. Secondly, we present our proposed framework and approach for the identification of representative excerpts from music audio signals. The system aims to extract a short abstract that serves as a 'hook' or thumbnail of the music and generates a retrieval cue from the original audio files. Instead of simply pursuing the present literature that mainly accentuates the repetitiveness of audio excerpts in the identification task, we also investigate the potential of audio descriptors in capturing specific characteristics of the representative excerpts. A database of 28 music tracks that comprises popular songs from various artists is used to evaluate the performance of our identification system. By …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analaysis of IFLA Library Refrence Model’s Entities and Attrbutes For Iranian Traditional Music Resources (Case study: Morq-e sahar Song)

Background and Aim: The object of the  study was to Analyze IFLA Library Reference Model (LRM) Entities and Attributes for Iranian Traditional Music Resources, Case Study: Morq-e Sahar Song. Method: The  study inherits an applied content analysis method. All Entities and Attributes of  IFlA LRM Model based on  two checklists include:  Final report of IFlA LRM on August 2017 and Transition Mappi...

متن کامل

Automatic Music Summarization via Similarity Analysis Automatic Music Summarization via Similarity Analysis

We present methods for automatically producing summary excerpts or thumbnails of music. To find the most representative excerpt, we maximize the average segment similarity to the entire work. After window-based audio parameterization, a quantitative similarity measure is calculated between every pair of windows, and the results are embedded in a 2-D similarity matrix. Summing the similarity mat...

متن کامل

Computing Structural Descriptions of Music through the Identification of Representative Excerpts from Audio Files

With the rapid growth of audio databases, many music retrieval applications have employed metadata descriptions to facilitate better handling of huge databases. Music structure creates the uniqueness identity for each music piece. Therefore, structural description is capable of providing a powerful way of interacting with audio content, and serves as a linkage between low-level description and ...

متن کامل

Automatic Audio Segmentation: Segment Boundary and Structure Detection in Popular Music

Automatic Audio Segmentation aims at extracting information on a song’s structure, i.e., segment boundaries, musical form and semantic labels like verse, chorus, bridge etc. This information can be used to create representative song excerpts or summaries, to facilitate browsing in large music collections or to improve results of subsequent music processing applications like, e.g., query by humm...

متن کامل

The Role of Emotion and Context in Musical Preference

The powerful emotional effects of music increasingly attract the attention of music information retrieval researchers and music psychologists. In the past decades, a gap exists between these two disciplines, and researchers have focused on different aspects of emotion in music. Music information retrieval researchers are concerned with computational tasks such as the classification of music by ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005